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Abstract 

Let (M,<7i) be a compact d-dimensional Riemannian manifold for d > 1. 
Let X n be a set of n sample points in M drawn randomly from a smooth 
Lebesgue density /, bounded away from zero. Let x, y be two points in M. We 
prove that the normalized length of the power- weighted shortest path between 
x, y through X n converges to the Riemannian distance between x,y under the 
metric g p = f 2 ( 1 ~ p ^ d gi, where p > 1 is the power parameter. 

1 Introduction 

The shortest path problem (see e.g., Cormen et al. (2009); Dijkstra (1959)) is of 
interest both in theory and in applications since it naturally arises in combinatorial 
optimization problems, such as optimal routing in communication networks, and 
efficient algorithms exist to solve the problem. In this paper, we are interested in the 
shortest paths over random sample points embedded in Euclidean and Riemannian 
spaces. 

Many graph structures over Euclidean sample points have been studied in the 
context of the Beardwood-Halton-Hammersley (BHH) theorem and its extensions. 
The BHH theorem states that law of large numbers (LLN) holds for certain spanning 
graphs over random samples. Such graph structures include the travelling salesman 
path (TSP), the minimal spanning tree (MST), and the nearest neighbor graphs 
(fc-NNG). See Steele (1997) and Yukich (1998). The BHH theorem applies to graphs 
that span all of the points in the random sample. This paper establishes a BHH- type 
theorem for shortest paths between any two points. 

In the last few years, the asymptotic theory for spanning graphs such as the 
MST, the fc-NNG, and the TSP has been extended to Riemannian case, e.g., Costa 
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and Hero (2004) extended the MST asymptotics in the context of entropy and in- 
trinsic dimensionality estimation. More general non-Euclidean extensions have been 
established by Penrose and Yukich (2011). This paper extends the BHH theorem 
in a different direction: the shortest path between random points in a Riemannian 
manifold. 

The asymptotic properties of paths through random Euclidean sample points 
have been studied mainly in first-passage percolation (FPP) models (Hammersley 
1966). Shortest paths have been studied in FPP models in the context of first passage 
time or travel time with lattice models (Kesten 1987) or (homogeneous) continuum 
models (Howard and Newman 1997; Howard and Newman 2001). Under the FPP 
lattice model, LaGatta and Wehr (2010) extended these results to the non-Euclidean 
case where interpoint distances are determined by a translation-invariant random 
Riemannian metric in M. d . This paper makes a contribution in a different direction. 
We assume a non-homogeneous continuum model and establish convergence of the 
shortest path lengths to density-dependent conformally deformed Riemannian dis- 
tances. The convergent limit reduces to the result of Howard and Newman (2001) 
when specialized to a homogeneous Euclidean continuum model. 

2 Main results 

In this paper, a differentiable function is an infinitely differentiable function. A 
smooth manifold means its transition maps are smooth. 

Let (M,gi) be a smooth (i-dimensional Riemannian manifold without boundary 
with Riemannian metric gi and d > 1. When M = M. d , we fix g± as the standard 
Euclidean metric. The use of the subscript for gi will become clear shortly. 

Consider a probability space (M,B,P) where P is a probability distribution 
over Borel subsets B of M. Assume that the distribution has a Lebesgue probability 
density function / with respect to g\ . Let X\, X2, . . . denote an i.i.d. sequence drawn 
from this density. For convenience we denote this sequence by X n = {Xi, . . . , X n }. 
The sequence X n will be associated with the nodes in a undirected simple graph 
whose edges have weight equal to the power weighted Euclidean distance between 
pairs of nodes. Observe that we will use indexing by n of generic non-random point 
x n € M. This point is not related in any way to the random variable X n . For 
realizations, we will use the notation X n {u) where uj is an elementary outcome in 
the sample space. 

For p > 1, called the power parameter, define a new conformal Riemannian 
metric g p = f 2 ^~ p ^ d gi. That is, if Z x and W x are two tangent vectors at a point 
x 6 M, then g p (Z x ,W x ) = f{x) 2{ - 1 ~ p ^ d g 1 {Z x ,W x ). The deformed Riemannian 
metric g p is well-defined for every x with f(x) > 0. In this paper, we assume p > 1 
except for a few places where we compare with the un-deformed case p = 1 . 

The main result of this paper, stated as Theorem 1, establishes an asymptotic 
limit of the lengths of the shortest paths through locally finite point processes. A 
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subset A C M is locally finite if A n B is finite for every B C M oi finite volume. 
For example, a homogeneous Poisson process in ~R d is locally finite with probability 
one when M = M. d . For x,y G M and locally finite A C M, let L(x, y;-A) denote 
the power- weighted shortest path length from x to y through A U {x,y}. Here 
the edge weight between two points u and v is disti(-u, -u) p where disti denotes the 
Riemannian distance under the metric g\. For convenience, we use the shorthand 
notation L n (x,y) for L(x,y;X n ). 

For x G M and r > 0, we denote by B(x;r) the open ball in M of radius r 
centered at x, i.e., B(x;r) = {u E M : disti(x,u) < r}. 



2.1 Main result 

The following is the main result of this paper. 

Theorem 1. Assume that M is compact, and that f is smooth with infj^ / > 0. 
Let < b < 1 and c > be constants. Then for every fixed e > 0, 

limsup(ninf/)^p{sup v ^}^\ r - C(d,p) > e) < 0, 

where the supremum is taken over x,y £ M such that 

disti (x,y) > c(nmf f)- b ^ d+2p \ 

and distp denotes the Riemannian distance under g p . C(d,p) is a positive constant 
that only depends on d and p. 

The constant C(d,p) is fixed throughout this paper. This constant is also de- 
noted as fi in Howard and Newman (1997); Howard and Newman (2001). 

An immediate consequence of the Borel-Cantelli lemma is the almost-sure con- 
vergence of L n (x, y). 

Corollary 2. Suppose that the conditions assumed in Theorem 1 hold. Let x, y G M. 
Then 

lim n^ _1 ^ d L„(x,y) = C(d, p) dist p (x, y) a.s. 

n— 5>oo 

As in Theorem 1, dist p denotes the Riemannian distance under g p , 

dist p (x,y) =inf I' f(lt) {1 - p)/d ^9i(ivl't)dt, 

where the infimum is taken over all piece-wise smooth curves 7: [0, 1] — > M such 
that 7(0) = x and 7(1) = y. 

When p = 1, there is no power- weighting of the edges, C(<i, 1) = 1 and the 
shortest path length is the Riemannian distance disti(x,y) from x to y. 
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2.2 Discussion 



Theorem 1 can be compared to analogous results in the continuum FPP model of 
Howard and Newman (2001). The main differences are the following: (i) the results 
of Howard and Newman (2001) are restricted to the case of uniformly distributed 
node locations X n while our results hold for the important case of non-uniformly 
distributed nodes; (ii) our convergence rates improve upon those of Howard and 
Newman (2001). 

Specifically, Howard and Newman (2001, Theorem 2.2) show the following bound 
on the shortest path lengths in a homogeneous Poisson point process. Let £\(x,y) 
denote the power-weighted shortest path length from x G R d to y G R d through 
random nodes in a homogeneous Poisson point process H\ of intensity A > 0. 

Howard and Newman (2001, Theorem 2.2) state the following. Let m = min(l, d/p), 
K2 = l/(4p + 3), and e\ G M. d be a unit vector. For any < b < K2, there exist 
constants Cq and C\ (depending on b) such that for t > and t b < sy/i < t K,2 ~ b , 



(1) 



£i(0,tei)-C(d,p) 



> sj < Ciexp(-C (s v / t) Kl ). 



Note that the fastest decay rate achievable by (1) is bounded above by an expo- 
nential decay of t KlK2 = £min(i,d/p)/(4p+3)_ 

On the other hand, our Corollary 8 implies, after simple Poissonization of the 
sequence X n , 



(2) 



lim sup t d + 2 p P 
A— »oo 



^A(0,tei)-C(d,p) 



> s } < 0, 



so that the decay is exponential in t d ^ d+2p \ Under the condition d > 1, p > 1, the 
decay rate (2) is faster than the rate (1). 

It is useful to compare Theorem 1 with BHH results. The convergence result 
established in this paper differs from previous BHH theorems in two ways. The 
first difference is that Theorem 1 specifies a limit of the shortest path through X n 
while BHH theory (Steele 1997; Yukich 1998) specifies limits of the total length of 
a graph spanning X n , e.g., the minimal spanning tree (MST) or the solution to the 
traveling salesman problem (TSP). The second difference is that the shortest path 
has fixed anchor points, hence it is not translation-invariant. This is in contrast to 
BHH theory developed in Penrose and Yukich (2003) and Penrose and Yukich (2011) 
where Euclidean functionals are generalized to locally stable functionals while the 
translation-invariance requirement is maintained. 



3 Main proofs 

An obvious but important property of L(x, y; A) for x,y G M and locally finite 
A C M is that if A' C A then L(x,y;A) < L(x,y; A'). This property is used in 
several places in the proofs. 
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3.1 Local convergence results 

Theorem 1 states a convergence result of random variables in manifolds. Theorem 1 
is obtained by extension of a simpler theorem on Euclidean space. 
We first prove an upper bound for shortest path edge lengths. 

Lemma 3. Let z G R d and R > 0. Assume that Xi, . . . ,X n is i.i.d. in R d with 
probability density function (pdf) f and assume that there exists a constant f m >0 
such that the pdf f(u) > f m for all u G B(z; R). Fix a number a G (0, 1). 

Define the event H n (i,j) for each pair l<ij^j<nas the intersection of the 
following events 

(i) both Xi and Xj are in B(z;R), 

(ii) \Xi-Xj\ > (n/ m ) (a_1)/d , and 

( Hi ) the shortest path from Xi to Xj contains no sample point X^ other than X; L 
and Xj . 

Let F n = - H n (i,j) c , where the superscript c denotes set complement. Then 
limsup — log(l - P(F n )) < 0. 

n^oo (njm) 

Proof. Suppose that the direct path Xi — > Xj is not the shortest path between Xi 
and Xj, i.e., there exists k ^ i,j such that the path Xi — > X^ — > Xj is shorter than 
Xi — > Xj as measured by the sum of power- weighted edge lengths: \Xi — Xh\ p + 
\Xt — Xj\ p < \Xi — Xj\ p . 

Define h(Xi,Xj; •) : R d ->• R, 

(3) h{Xi,Xj-u) = \Xi - u\ p + \Xj - u\ p - \Xi - Xj\ p , 

and let Q(Xi,Xj) = {u £ R d : h(Xi,Xj;u) < 0}. Note that if X k G Q(Xi,Xj), 
then Xi — > Xk — > Xj is shorter than Xi — > Xj. Note that the volume of @(Xi, Xj) 
is a function of the distance \Xi — Xj\ and that a certain proportion of Q(Xi,Xj) 
intersects with B(z;R). Therefore there exists a constant 6\ = 6i(d,p) > such 
that the intersection volume is at least 9i\Xi — Xj\ d for all sufficiently large n. 

Suppose that event H n (l,2) occurs. Then the shortest path from X\ to X2 
contains no sample point other than X\ and X2, and the intersection of Q(Xi, X2) 
and B(z; R) cannot contain any of X3, X4, . . . , X n . Since it is assumed that \X\ — 
X 2 \ > (nf m )^' i y d , 

p(ff„(i,2)) < (l-ei/y- 1 )"" 2 ' 

There are n(n — l)/2 < n 2 pairs of sample points, hence 

1 - P(F n ) = P(|J . . H n (i,j)) < n 2 (l - 1 f2n a - 1 ) n ~ 2 , 
and the claim follows. □ 
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Next we provide following two propositions on the cardinality of the shortest 
paths, and mean convergence of EL n . Their proofs require some results from the 
theory of Poisson processes, and we defer them to section 4. 

Lemma 4. Let z G M. d , R2 > R\ > 0, and a = (d + 2p)~ l . Assume that the pdf f 
is uniform and supported in B(z; it^)- Let x n , y n be sequences in B(z; R\) satisfying 
liminf n (n/(z)) a |x n - y n \ = +00. 

Let #L n (x n ,y n ) denote the number of nodes in the shortest path. Then there 
exists C* > such that if G n denotes the event 



,A\ #L n (x n , y n ) 

1 ' (nf(z))^\x n -y n \ ~ : 

then 

(5) l m ^ p wwfcs 108(1 - p(G " ))<0 ' 

Proposition 5. Let z G R d , R2 > R\ > 0, and a = (d + 2p) _1 . Assume that the 
pdf f is uniform and supported in B{z;R2)- Let x n ,y n be sequences in B{z;R\) 
satisfying liminf„(ra/(z)) a |x n — y n \ = +00. Then 

EL n (x n ,y n ) 



From the previous three results, we obtain the following preliminary local con- 
vergence result. 

Proposition 6. Let z G R d , R2 > R\ > 0, and a = (d + 2p)~ 1 . Assume that the 
pdf f is uniform and supported in B(z;R2). Let x n ,y n be sequences in B(z;R±) 
satisfying liminf„(ra/(2:)) a |x n — y n \ = +00. Fix e > and let E n denote the event 
that 



(6) 
then 



L " i ' r "- !J " ) -C(d,p) 



(nf{z))V-PV*\x n - y n \ 



(7) HmSUP Mi))4^ 1Og(l - P( ^ ))<0 - 



71— >00 



While it is possible to obtain a weakened form of Proposition 6 from Howard 
and Newman (2001), we provide an alternative proof with improved convergence 
rate. 

Proof of Proposition 6. Our proof is structured similarly to that of Yukich (2000, 
Theorem 4.1) and Talagrand (1995, Section 7.1). Let 
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• F n be the event that all the shortest path link distances are at most (nf(z))( a l ^ d 
(See Lemma 3 for F n ), 

• G n be the event that #L n (x n ,y n ) < C*(nf(z)) l / d \x n — y n \ for the constant 
C* specified in Lemma 4, 

• H n be the event that at every point u G B(z; R2), at least one of the sample 
points is in B(u; (nf(z)) ( - a ~ 1 ^ d ). 

All these events occur with high probability. Both 1 — P(F n ) and 1 — P(G n ) are 
exponentially small in (nf(z)) a \x n — y n \ by Lemma 3 and Lemma 4, respectively. 
The probability 1 — ~P(H n ) may be shown to be exponentially small as well by an 
argument similar to the proof to Lemma 3. 

We use shorthand notation L n for L n (x n , y n ). For every a > 0, define W n (a) to 
be the event that L n > a. Let w G F n n G n and rj G H n n W n (a) be two elementary 
outcomes in the sample space. Let vr*(w) be the shortest path L n (u) from x n to y n 
through the realization X n (u) = {Xi(oj), . . . ,X n (uj)}. If tt*(oj) is the sequence 

x n = 7r (a;) ->• 7ri(cj) ->• >• 7r fe (w) = y„, 

where = #L n (cu), then we may build a path 7r(r/) from x n to y ra through an- 
other realization X\{rj),. . . ,X n (rj) as follows. For each i G {l,...,/c — 1}, let j 
denote the index where Xj(co) = n^co). If Xj(co) = Xj(rj), then set 7r^ (77) = 
TTi(oj). Otherwise, since 77 was assumed to be in H n , there exists some I such 
that X,(t7) G B(7r i (w);(n/(2))( a - 1 )/ <i ). Set 7^(77) = X^r?). Then it follows that 
\in{ri) -in(u)\ < {nf{z)f a -^/ d for alH = 1, . . . , k. 

Let I be the set of indices i where 7Tj(a>) 7^ If L(ir) denotes the power- 

weighted length of the path vr, then L(ir(r))) < L(tt*(oj)) + 2\I\3 p (nf(z)) ( - a ~ 1 ^ d 
since w G F n . On the other hand, 77 G W n (a) and hence 

L(tt») = L„(w) > a - 2|/|3 p (n/(z))( a - 1 ^. 

Let d c (uj; H n C\W n {a)) be the convex distance of u to H n (~) W n {a) (See Talagrand 
1995, Section 4.1). By Lemma 4.1.2 of Talagr and (1995), there exists 77 G H n nW n (a) 
such that |7| < d c (uj; H n n W n (a)) -y/#L n (w), and hence 

L n ( W ) > a - 2 • 3 p • d c ( W ; H n n W„(a)) A /#7^H(n/(z))^ 1 )^. 
In particular, if L n (u) < a — u for u > then 

u ( n /( z ))(l-«)p/d 



2 ■ 3P ■ VClKW) 1/d kn-ynr 
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since u G G n . 

Let M n be the median of L n . Set a = M n in (8) and apply Theorem 4.1.1 of 
Talagrand (1995) to obtain, for u > 0, 

(Colt 2 2j>(l-a)-l 
-i — z — ,(«/(*)) — 5 — 
I %n Un | 

+ (l-P(F n )) + (l-P(G n )), 

where C2 = (2 4 3 2p Ci) _1 , since P(H n ) approaches one as n — > 00. For an upper 
bound, set a = M n + u in (8). Then, similarly, 

(CoV? 2p(l-a)-l\ 
-1 — ^WW) — * — + (1 - P(H n )), 
I %n Un I / 

for sufficiently large n since both P(F n ) and P(G n ) converge to one as n — > 00. 
Therefore 



(9) 



\L n — M„ 



{nf{z))^-v)l d \x n - y n 



> u j < 6exp(-C 2 |x n - y n \ (nf(z)) a u 2 ^ + h n , 



where h n = (l - P(F n )) + (l - P(G n )) + (l - P(# n )). 

Now we show that the median M n and the mean EL ra are close. By Jensen's 
inequality, \EL n - M n \ < E\L n - M n \ = J °° P{\L n - M n \ > u] du. Note that 
P{|L n — M n \ > u} = when u > \x n — y n \ p since L n = L n (x n ,y n ) is bounded 
above by \x n — y n \ p ■ Integrate the first term on the right side of (9) for u > 0, and 
integrate the second term from u = to {nf{z)) lj> ~ 1 ^ d \x n — we obtain an 

upper bound 

(nmY^xi - y n \ - V C 2 \x n - ylwiW + " ynl T K - 

The probability of the events F n , G n , and H n approach one exponentially fast in 
(nf(z)) a \x n - y n \, therefore 

Um \EL n - M n \ = Q 

n^o (n/(z))(p- 1 )/ rf |x„-y n | 



By Proposition 5, for sufficiently large n, 



1 -C(d, P ) 



1 < pi I L n — M n I e 

~ " (nf(z))(P-W\x n -y n \ 2j- 



(n/(z))(p-i)/^|x„-y n | 

Thus the claim follows from (9). □ 

Next we show that Proposition 6 can be extended to non-uniform probability 
density /. 
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Theorem 7. Let z £ R d and R > 0. A ssume that the pdf f is uniform in B(z; R) 
but may have probability mass outside of B(z; R). 

Let < b < 1 and c > be constants and let a = (d + 2P)" 1 . Fix e > and 
denote by E n the event that 



(10) 



sup 



L n (x,y) 



{nf{z)Y 1 -v)/ d \x-y\ 



C(d,p) 



<£, 



where the supremum is taken over x, y G B(z; 12/4) and (nf(z)) \x — y\ > c. Then 

1 



(11) 



lim sup — 

rwoo {nf{z))^- b )'- 



log(l - P(E n n F n )) <0, 



where F n is the event defined in Lemma 3. 

The condition (nf{z)) ba \x — y\ > c > is introduced to guarantee that n a inf|x — 
y\ has polynomial order and to prevent sub-polynomial, e.g., logarithmic, growth. 

Proof. Let L(x,r; X n ), r > 0, denote the minimal power- weighted path length over 
all the shortest paths from x to all boundary points of B(x;r), i.e., L(x,r;X n ) = 
inf L n (x, u) over all u G K d and |x — u| = r. For x,r satisfying B(x;r) C B(z;R), 
we claim that 



(12) 



lim sup 



1 



r(nf(z)) c 



logP 



L(x,r; ;f n 



r(n/(z))( 1 -p)/ d 



C(d,p) 



> e > < 0. 



To establish (12), first note that the boundary of B(x;r) may be covered with 
open balls of radii (nf(z)Y a ~ 1 ^ d , and the number of cover elements may be chosen 
less than (2nf(z)r) d . Let vi,V2, ■ ■ ■ , v m be the centers of the cover elements. If the 
event F n in Lemma 3 occurs and 



L n (x,v k ) 



r{nf{z))^-p)/ d 



C(d,p) 



< 



for all k = 1, . . . , m, then \(nf(z)) ( - p ~ 1 ^ d r~ 1 L n (x,u) — C(d,p)\ < e for all u on the 
boundary of B(x;r) for sufficiently large n. Note that L(x,r;X n ) = L(x,r;X n n 
B(x;r)), since the boundary of B(x;r) disconnects the interior and the exterior. If 
the shortest path to the boundary were to reach any point outside B(x; r), the path 
must have passed through the boundary, which is a contradiction. An application 
of Proposition 6 to L(x, X n n B(x; r)) and an application of the Chernoff bound 
(Billingsley 1995, Theorem 9.3) proves the inequality (12). 

Let x n ,y n G B(z;R/4) be two points satisfying (nf(z)) ba \x n — y n \ > c. Let H n 
denote the events that 



l'n(^'n) Hn) 



(nf(z))^-P)/ d \x n - y„ 



< C(d,p) + e, 
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and let K n denote the event that 



(nf(z))(i-p)M\x n -y n 

We claim that 



> C(d,p) - e. 



(13) l = p {nf{z ] ){1 . b)a Ml - P(fl. n a.)) < 0. 

We break the proof of claim (13) into two parts dealing with H n and K n sep- 
arately. The part for H n is simple. The inequality L n (x n ,y n ) = L(x n , y n ; X n ) < 
L(x n , y n ; X n r\B(z; 12)) and an application of Proposition 6 with the Chernoff bound 
show that 1 — P(H n ) has an exponential decay in (nf(z)) a \x n — y n \ > c(nf(z))( 1 ~ b ^ a . 

For the K n part, let A n denote the event that L(x n ,y n ; X n ) = L(x n ,y n ; X n n 
B(z; R)). Suppose that A n did not occur, i.e., L(x n , y n ; X n ) contains a node outside 
B(z; R) hence outside B(x n ; 512/8). The reader may find Figure 1 helpful. It follows 
that L(x n ,5R/8; X n ) < L(x,y; X n ), and 

i - P(K n ) = p(jq n A„) + ?(K n JQ 

£p |L(x^|nB(^ < £ 

[ {nf(z))^ P>' d \Xn ~ Vn\ 

f L( gw ,5fl/8;^) 

Apply Proposition 6 with the Chernoff bound to the first term on the right side 
of the above inequality, and apply (12) to the second term. Therefore 1 — P(K n ) 
decays exponentially in (nf(z)) a \x n — y n \ > c(n/(z))( 1_fc ) a . We have showed that 
claim (13) holds. 

Now we are ready to prove the theorem. Suppose that the event F n occurs, 
and let {B(wf, (nf(z))( a ~ 1 ^ d ) : Wi £ B(z; R/4),l < i < m} be an open cover of 
B(z; 12/4) with m < (nf{z)R) d . Then for every x, y G B(z; 12/4) there exists W{,Wj 
such that \x — Wi\ < (nf(z))( a ~ 1 ^ d and \y — Wj\ < (n/(z))( a_1 )/ d , hence 

\\x-y\- \ Wi - Wj \\ < 2(nf(z)) (a - 1)/d 

and 

\L n (x,y) - L n ( Wl , Wj )\ < 2P +1 (nf(z))^- 1 ^ d . 

Since \x — y\ > c(nf(z))~ ab and F n is assumed to occur, an algebraic manipulation 
shows that if n is sufficiently large, 



L n (x,y) L n (wi,Wj 



\x - y\(nf(z))( 1 -P)/ d \wi - w j \(nf(z))( 1 -py d 



e 

<2' 
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Figure 1: When x n G B(z;R/4), then B(z;R/4) C B(x n ;5R/8) C B(z;R). When 
y n G B(z; R/4), L n (x n ,y n ) is conditionally independent of the outside B(z;R) on 
the event L n (x n ,y n ) < L n (x n ;5R/8), due to the annulus buffer region {u: R/4 < 
\z — u\< R}. 



hence for n sufficiently large, 



P< sup 

I x,y 



L n (x,y) 



C(d,p) 



{nf{z))( l -p)l d \x-y 

<(l-P(F n )) + ]T 



> e 



L n (wi,Wj) 



{nf{z)Y 1 ~P)/ d \w i -w j 



C{d,p) 



> 



where the sum is over i and j that \wi — Wj\ > c(nf(z))~ ab — 2(nf(z))( a ~ 1 ^ d . The 
theorem now follows from (13) and Lemma 3 as m < (nf(z)R) d is of polynomial 
order of nf(z). □ 

Corollary 8. Let f m , Im be constants such that < f m < f(u) < f\j < °o for all 
u G B(z;R). Let < b < 1 and c > be constants and let a = (d + 2p)~ l . Fix 
e > 0. Let H n denote the event that 



(14) 



sup 



L n (x,y) 



< C(d,p) + e, 



cy (nf m )^ d \x-y\ 
where the supremum is taken over x, y G B(z; R/4) and (nf m ) ba \x — y\ > c. Then 



(15) 



lim sup -7. , 

n ^oo P (n/ m )(!- b ) 



^\og(l-P(H n nF n )) <0, 



where F n is the event defined in Lemma 3. 
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Similarly, let K n denote the event that 
. L(x,y;X n ) 

mf t t \(i-v)/d\ 1 - C ( d 'P) ~ e > 

X 'V (nfM) {1 p )' d \x-y\ 

where the infimum is taken over x,y G B(z;R/A) and (nfM) ba \x — y\ > c. Assume 
that J B r z . R \ Jm du < 1. Then 

(16) limsup 1 log(l-P(K w nF n )) <0. 

Proof. Consider the claim (15) involving H n first. For each point Xi G Af„, if 
is in B(z;R) then discard Xj with probability 1 — f m /f(Xi). Let Af n denote the 
filtered collection from X n . Then, obviously, L n (x,y) = L(x,y;X n ) < L(x,y;X n ) 
for all x,y € R d , and if L n (x,y) < (C(d,p) + e) {nfm) l - 1 ~ p),d \x - y\ holds, then (14) 
holds. Note that X n is an i.i.d. sample with uniform density f m when restricted to 
B(z; R). Apply Theorem 7 with the Chernoff bound, and we obtain (15). 

For the claim (16) involving K n repeat a similar argument to prove (16). □ 

3.2 Convergence in Riemannian manifolds 

We adapt Corollary 8 to the case when the probability distribution is supported on 
a Riemannian manifold M instead of on a Euclidean space. 

Lemma 9. Let (M,g±) be a Riemannian manifold equipped with metric g±. Let 
b G (0, 1), c > 0, e > be fixed constants. Let a = (d + 2p)~ 1 . Assume that 
f(z) > and that f is smooth at z, i.e., f is infinitely differ entiable at z. We 
denote by E n = E n (z, R), for z G M and R > 0, the event that 

(i) every shortest path link in B(z;R) has length less than (nf(z)) ( - a ~ 1 ^ d , and 

(ii) the supremum 

L n (x,y;X n ) 



(17) sup 



n (i- P )/d dist p (x,y) 



C(d,p) 



< e, 



for x,y G M such that disti(x,z) < R, disti(y, z) < R, and disti(x, y) > 
c(nf(z))- ab . 

For every z G M, there exists R > such that 

limsup (nf(z)) a( - b -V l g(l - P(E n )) < 0. 

n^oo 

Proof. The event condition (i) may be easily proved with the same arguments from 
Lemma 3. We concentrate on condition (ii). 
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Define U = B(z;4R) = {u G M: disti(u,z) < 4i?} for R > 0. For every 
5 > we may choose i? > small enough so that there exists a normal chart map 

ip: U C M -»• V C R d , 

(18) (1 - 5) d su Pc/ / < /(*) < (1 + ,5) d infc; /, 
and 

(19) l-o< r - r ^ ^r^l + o, 

for all u ^ v £ U. Note that the denominator in (19) is a Euclidean distance. 

Let x, y £ B(z;R) C ?7. Let L((p(x), <p(y); <p(X n )) denote the shortest path 
length between ip(x),ip(y) € V in Euclidean space M d . Suppose that 

holds. Then by the assumptions (18) and (19), 

(C(d,p) - eVmfufiW* < n ^} X,y \ r 

(21) V ' ~ n( 1 -P)/ d dist 1 (x,y) 

< {C(d,p)+e)( S u Pu f)^/ d , 

when S is sufficiently small. We next show that (17) follows from (21). 

Since x, y 6 B(z; R), the minimal gi-geodesic curve from x to y is contained in 
U by assumption (19). It follows from the definition of Riemannian distance that 

(22) distpOr, y) < disti(x, y)(in% /jC 1 "*)/* 

Furthermore, if the minimal ^-geodesic curve from x to y were contained in U, then 

(23) distp(x, y) > disti(x, y)(su Pc/ /^^A*. 

If a (piece- wise) smooth curve from x exits outside ?7, then the curve length under 
g p must be at least (3-R)(su Pi j f)0--p)/ d by the assumptions disti(x,z) < R and 
U = B(z;AR). Therefore it follows from (18) that (23) holds since 

dist p (x,y) < distx^yXinft//)^-^ 

/ 1 s\0--p)/d 
<(2R)(su Pu f)^(^) 

<(3i?)(su Pc/ /)( 1 -f)/ d , 

if 5 were sufficiently small, and the minimal (/ p -geodesic must be contained in U. 
After we combine (21), (22), and (23), 

(C(d,p) - e) dist p (x,y) < n^/ d L n (x,y) 

< (C(d,p) + e) dist p (x,y), 

holds provided that (20) is true. Apply Corollary 8 to (20), and the lemma is 
proved. □ 



13 



Figure 2: Path division procedure described in the proof of Theorem 1. Here k = 4. 
Note that Z{ £ C/j and Zj+i £ Vi for i = 1,2, 3. Shortest path is depicted as a smooth 
curve for illustration purpose only and it is actually piece-wise smooth. 

Our main result Theorem 1 can be now obtained by applying Lemma 9 to a 
finite open cover of the manifold. 

Proof of Theorem 1. The crux of the proof is that the shortest path length has near 
sub- and super-additivity with high probability. We will show that if Lemma 9 holds 
in open cover elements, then the local convergences may be assembled together to 
yield global convergence of the curve length. 

For each wi £ M, we may associate positive Ri > such that Lemma 9 
holds within the region Vi = {v £ M : disti(t> , Wi) < 322j}. Let Ui = {u £ 
M : disti(n, Wi) < Ri}. Since M is assumed to be compact, we may choose fi- 
nite m > 0, {wi £ M}™L l5 and corresponding {Ri > 0}™ 1 such that corresponding 
{Ui} is a finite open cover of M. 

Reorder the indices if necessary so that x £ U±. Define z\ = x. If L n (x, y) ever 
exits V\ , then a point Z2 £ V\ on the shortest path may be chosen such that Z2 ^ U\ . 
Then disti(zi, z<i) > R\. Note that Z2 £ M need not be in ^f n . Reorder the indices 
of the open cover again if necessary so that Z2 is in c7 2 . Repeat the procedure until 
L n (x,y) ends in, say, V k . Set z k+ i = y. Then points x = z\, z 2 , ■ ■ ■ , z k , z k+1 = y 
satisfy the conditions Zi, Zi+i £ Vi for i = 1,2, ... ,k, and disti(zj, Zj+i) > Ri > i? 
for z = 1, 2, . . . , k — 1, where i? = minji^j. The last edge length disti(zfc, z k+ ±) 
may be less than R. However, note that z k -i £ U k ~i and y = z k +i Vk-i by 
definition, hence disti(zfe_i, z k +i) > 2Rk-i > 222. Therefore Zfe m &y be adjusted so 
that disti(zfc) z k +i) > 22 as well, and it is easily checked that z k is still in V k . See 
Figure 2 for illustration. 
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Suppose that 

(24) (C(d,p) - e) distpte, Zi+i) < n^^L^, 

holds for all i = 1,2, ...,k. Then by the triangle inequality and the property 
u p + ujP <(u + ujf for i/, co > 0, 

(C(d,p) -e)dist p (x,y) < (C(d,p) - e) ^ dist p (^, z i+ i) 

i=i 

fc 

<^ n (p-DAi Ln( ^ +l) 

i=l 

Since m is finite and Lemma 9 applies in Vi, V2, ■ ■ ■ , Vm, 

Umsup(ninf/)( ft - 1 )/( d+2 P)p(inf n ^} X,V \ r < C(d,p) - e) < 0. 

For the upper tail, we follow a similar strategy to Bernstein et al. (2000). If 
z\ = x, Zk+\ = y, and z% are points on the minimal geodesic curve from x to y under 
g p , then dist p (x, y) = Yli=i dist p (zj, We showed above that the points may 

be chosen and indices of the open cover may be rearranged such that Zi, Zj+i 6 V 
and disti(zj, Zi+i) > R for all i = 1, 2, . . . , fc. Since Lemma 9 applies in Vi, . . . , V m , 
every edge length of the shortest path from z% to Zi+\ is at most (nsup /)( Q_1 )/ (i for 
i = 1, . . . , fc, where a = (d + 2p)~ 1 from Lemma 9. Therefore each paste procedure 
may incur additional cost of at most 2 p (nsup/) p ( a ~ 1 )/ (i so that 

k 

(25) L n (x, y)<J2 L ^ Zi+i) + k2V(n sup f)^'^, 

i=l 

where a = (d + 2p)~ L . Therefore if Lemma 9 holds in Vi, V2, . . . , V m , then 

n^/ d L n {x,y) < (C(d,p) + |) dist p (x,y) + ^n^^/^sup / ) (a ~ 1)p/d , 

and if n is large enough, n^ p ~ 1 ^ d L n (x, y) < (C(d,p)+e) dist p (x, y) since n ( a P- 1 )/ d n ab 
shrinks to zero as n — > 00. Therefore Theorem 1 is established by applications of 
Lemma 9 to Vi, V2, . . . , V m . □ 



4 Mean convergence and node cardinality 

In this section, we prove Lemma 4 and Proposition 5. Since they were stated for 
sequences X n in a Euclidean space, we return to the Euclidean case M = M. d . We 
introduce a few additional notations used in this section. 
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As we mentioned, the proofs require some results in Poisson processes. We 
denote by %\ a homogeneous Poisson point process in M rf of constant intensity 
A > 0. Specifically, for any Borel set B of Lebesgue measure v{B) the cardinality 
Nb of %\ n B is a Poisson random variable with mean \u(B) and, conditioned on 
Nb, the points H\ n B are uniform i.i.d. over B. As in the i.i.d. case, we use a 
shorthand notation C\(x,y) = L(x,y;'Hx) f° r x,y € M. d . 

Let ei = (1,0, . . . , 0) € M rf be a fixed unit vector. By the translation and 
rotation invariance of %\, the distribution of £\(x,y) for x,y G M d is the same as 
the distribution of £\(0, te\) where t = \x — y\. This observation is used frequently 
in this section. 

Let T(u, v; b) for u, v G 6 > 0, denote the set 

(26) T(u,v;b)= [J B(su + (1 - s)v; b). 

0<s<l 

Note that U b>0 T(u, v i b ) = Rd - For 

convenience, define 

(27) C x (u, v- b) = L(u, v; U x D T(n, u; 6)). 



4.1 Percolation lemma 

The following lemma on percolation will be used in the proof of Lemma 4. 

Lemma 10. Let it be a graph path in T-L\ starting at 6 ~M. d . Suppose that tt has 
power-weighted path length at most co\( 1 ~ p ^ d and has at least ciA 1 ^ nodes for some 
co,ci > 0. Then there exists a constant po > 0, dependent on d and p, such that 
if c\ > pqCq then the probability that such path tt exists is exponentially small in 

Proof. The strategy of this proof is similar to that of Meester and Roy (1996, The- 
orem 6.1). We first define a Galton- Watson process X n . Let Xo = {xq = 6 R d } 
be the ancestor of the family, and associate the parameter ro > 0. Then define 
the offsprings Xi(ro) to be 1~L\ n B(xq\ r^ p ). Xi(ro) is the set of points in 7-l\ that 
may be reached from xq with exactly single edge with path length at most ro in 
power-weighted sense. Note that E|Xi(ro)| = XVdr^ p where |Xi(ro)| denotes the 
cardinality of Xi(ro), and Vd denotes the volume of B(0; 1). 

For each offspring x\^ G Xi(ro), we associate the parameter = ro — \x\^ — 

xq\ p . Then 1~L\ in the union of B{x\^r^?) — {x\^} over k is the set of points that 
may be reached from xo with exactly two edges, while the power-weighted path 
length is at most ro- Define X2(ro) to be the collection of all the second generation 
offsprings, and define recursively the n-th generation offsprings X n (ro). Then X n (ro) 
is the set of all the points that may be reached in n hops from the ancestor xq within 
path length ro- See Figure 3. We prove by induction that 

(28) E|X„M < { ^< r ^±%. 



16 



Figure 3: A run through the family tree generated by X n with p = 2. The point xq 
is the ancestor with parameter ro = 9. This means that all the runs through the 
family tree are paths with power-weighted length less than r^ p = 3. Here x± t i € Xi 

is among the first generations since it is within B(xq; r^ p ), and X2,i € X2 is among 
the second generations since it is within the balls centered at the first generation 
offsprings, e.g., x\ t \. This particular run ends at x^i as there is no point in the 
vicinity. In this example, the power- weighted path length is \J\ 2 + 2 2 + 1.5 2 + l 2 = 
\/8.25 < 3. Note that £2,1 is also in the ball centered at xo, so it is also a first 
generation offspring. Some other runs through the family tree will have the point 
£2,1 as a first generation offspring. 

We mentioned above that E|Xi(ro)| = AV^rg^ p , and (28) is true for n = 1. For 
general n, apply the Campbell-Mecke formula, (Baddeley 2007, Theorem 3.2, p. 48) 



E|X n (r )| < A f E|X n _!(r - \x-x G \ p )\dx 

JB(xo;rf/ p ) 

< AT- V' 11 !^! / (ro -\x- xof) {n - 1)d/P 

- d T(l + (n-l)d/p)J B( i/ P) { ° 1 01 ^ 



dx 

(xo;r ' 

-{^ d r Q ) T{1 + nd/p y 

We have established (28). 

Using the Markov inequality and Stirling's approximation, 

logP{X n (r ) + 0} < nlog^r(l + |) Q ■ + o(n) 

as n —¥ 00. Note that if a path starting at x passes through more than n = c\\ l l d 
nodes and has path length less than ro = co\ ( - 1 ~ p ^ d , then the n-th generation set 
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^n(^o) will n ot be empty. Lemma 10 follows since, if the ratio c\/cq is sufficiently 
large, the logarithm term above is negative. □ 

4.2 Mean convergence for Poisson point processes 

Lemma 11. Consider the shortest path length £i(0,tei) from € R d to te\ G R d 
in Tii for t > 0. 

(29) lim -E£i(0,tei) = C(d,p). 

t—too t 

In addition, if b = bt is a function oft satisfying liminf t & t = oo, then 

(30) lim -E£i(0,tei;&t) = C(d,p). 

t^rCO t 

Recall that d(0,tei;bt) denotes L(0, tei; Hi D T(0, tei; b t )) from (26) and (27). 

When 6 = +oo, (29) is a consequence of, e.g., Howard and Newman (2001, Sec- 
tion 4). The main difference is the case when b < +oo. Howard and Newman (2001, 
Theorem 2.4) states that the probability that the event A(0, tei) / £i(0, tei; h) oc- 
curs is exponentially small of order at least t 3pe / 4 for some e > when bt > t 3//4+e . 
Lemma 11 is weaker in the sense that it only asserts closeness in the mean. On 
the other hand, Lemma 11 is stronger in the sense that the assumption on bt is 
relaxed so that bt need only diverge to infinity, and the rate of growth may be even 
sub-polynomial. 

Proof of Lemma 11. Initially we let b > be a constant instead of a function of t. 
This assumption is removed later in the proof. Recall the definition of @(x,y) in 
the proof of Lemma 3. Let 

T{b) = |Jr(-sei,+sei;6), 

s>0 

and let 

&(A,6) = sup{|n-tei|: u G T(b), U\ n T(b) n @(u, tei) = 

In other words, £t(A, b) denotes the maximum distance of u £ T(6) from tei such 
that the shortest path from tei to u is the direct path tei — > u. From the continuity 
of function h which is used to define @(x,y) in the proof of Lemma 3, it is not 
difficult to show that there exist constants A, 5 > and constant integers k, m > 0, 
all independent of b and A, such that for all ( 6 K, 

(31) E£(A,&) < {XA)p/d + xp ^ b)p{d _ iy 

It is not surprising that the upper bound does not depend on t since T-L\ is homoge- 
neous. For a simple proof of this see Hwang (2012, Lemma 2.5, Equation 2.14). 
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Figure 4: An illustration of the path paste procedure. A new path from to (s+t)e\ 
is created by removing se\ = 70 and joining 7_ and 7+. Only the end points are 
fixed points in the new path. 

Let s,t > 0. Consider the shortest path £i(0, se±;b) between se\ and (s + 
t)e±, and let 7_ denote the node that directly connects to se±. Similarly consider 
the shortest path for £i(sei, (s + t)e\;b) and let 7+ denote the node that directly 
connects to se±. Therefore 7_ and 7+ are Poisson sample points incident to se±. 
For convenience let 70 = se\. Remove 70 = se± in the two paths, and join the nodes 
7_ and 7+ so that we have a new path connecting and (s + t)e±, as indicated in 
Figure 4. This new path has length that is an upper bound of £i(0, (s + t)e\; b), 

£i(0, (s + i)ei; b) < £i(0, sei; b) + £i(sei; (s + t)er,b) 

+ (2^-! - 1)(| T0 -T-l + |T+ -TolF, 

by the convex property of the power function for p > 1. Note that both I70 — 7- 1 and 
|7+ — 7o I are bounded above by £ s (l,6). Therefore E£i(0, (s + t)e\;b) is bounded 
above by 

E£i(0, sei; b) + E£i(sei; (s + t)d; b) + E(2&(1, 
= E£i(0, sei; 6) + E£i(0; tei; b) + E(2^ (l, 

The equality holds by the translation invariant property of the distribution of 
Hi. Therefore E£i(0, te x \ b) + E(2^ (l,&)) p is a sub-additive function of t. Note 
that £i(0, tei;b) < t' p , and t 1— > t v is Lipschitz in compact intervals. Therefore 
E£i(0,tei;6) is continuous for t > 0. A standard proof of the Fekete's lemma (for 
example, see Steele 1997, Lemma 1.2.1) may be easily adapted to continuous sub- 
additive functions, and the limit of t _1 E£i(0, te\\ b) exists. The limiting value is 
given by 

(32) Urn W, tei ;6) = inf E^M+IW^ 

t^oo t t>0 t 

and we denote the limit by C(d,p; b). In the special case b = 00, C(d,p; 00) = C(d,p) 
establishing (29). 
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We now show that C(d,p;b) converges to C(d,p;oo) = C(d,p) when b — > oo. 
Choose an arbitrary e > 0. By (31) and by the fact that C(d,p) is the limit of 
t _1 £i(0, tei), there exists T > such that 

^E£i(0,Tei) <C{d,p) + £ -, 

and 

±E(2£ (1,&))*<|, 

for all b > 1. For this fixed T, note that lim^oo £i(0, Tei; b) = £i(0, Tei) monoton- 
ically from above almost surely, and by the monotone convergence theorem, there 
exists B > 1 such that for all b > B and fixed T, 

^E£i(0,Tei;&) < ^EA(0, Tei) + |. 

Combine the three inequalities above with (32), 

C(d,p;b) < i(E£ 1 (0,Te 1 ;6) + E(2e (l,6)) p ) <C(d,p)+e, 

for all b> B. Therefore C(d,p; b) converges to C(d,p) as b — > oo. 

Finally, suppose b = bt is a function of t rather than a constant. If liming bt = oo 
then 

C(d,p) < lim -E£i(0,tei;6<) 

t— >oo t 

< lim -ECi(0,tei;B) = C(d,p;B), 
for every fixed B > 0. (30) follows as i? — > oo on the right side. □ 
4.3 Shortest path size 

In order to prove Lemma 4, we need an exponential probability bound on an upper 
tail of C 

Lemma 12. Let z € R d , R 2 > R± > 0, and a = {d+2p-l)~ 1 . Letx\,y\ € B(z;Ri) 
be functions of A satisfying liminfA^oo ^ a \x\ — y\\ = +oo. Fix e > and let E\ 
denote the event that 

/ QQ \ L(x x ,y\; / H\nB(z;R 2 )) 

(33) Mi- v )/d\ 1 <C(d,p) + e. 

Then 

(34) lim sup — — log(l - P(E X )) < 0. 
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Proof. Let b\ be a positive-valued function of A such that b\ — > and \ 1 / d b\ — > oo 
as A — > oo. Recall the definitions (26) and (27). Since 6a — > 0, for all sufficiently 
large A, 

n x n T(x x , y x ; b x ) C H\ n B(z; R 2 ), 

and hence 

L(x\, y x ; Ux n J? 2 )) < £a(^a, ?a; &a)- 

Let £" A denote the event that (33) holds with £a(^a> V\\ bx) m place of L(xx,yx'-,U\^ 
B(z;R2)). By the inequality above, if E\ occurs then E\ occurs. Therefore it is 
sufficient to show that 1 — P(E' X ), which dominates 1 — P(E X ), is exponentially small. 
Fix sufficiently large A so that the inequalities above hold, and set b = bx- 

As in Lemma 11, by the convex property of the power functions, Cx(0, 2be\; b) 
may be bounded above by C\(0,bei;b) + C\(bei, 26ei; b) + (2 p ~ l - \)(Z{ + Y p ), 
where and are the first and the last edge lengths in Cx(kbe\, (k + l)be\;b), 
respectively. In Figure 4, when s = b and s + t = 2b, Z\ and Yq correspond to 
1 7+ — 70 1 and I70 — 7_|, respectively. 

Note that the shortest path for Cx(kbe\, {k + \)be\; b) is not likely to be the direct 
path kbei — > (k + l)6ei. That is, if it were the direct path, then Ux is empty in 
@(kbei, (k+\)be\) where was defined in Lemma 3, and it happens with probability 
at most exp(— \9ob d ), where 9q = 9o(d,p) > was also defined in Lemma 3. If none 
of the shortest paths for £\(kbei, (k + l)bei; b) is a direct path, then the previous 
paste procedure may be repeated so that 

m— 1 

(35) £ A (0,mtei;&)< (Cx{kbe u (k + l)6e i; b) + (2^ - l)(Z p k + Y fc p )) , 

k=0 

with probability at least 1 — m exp(— X9o(d,p)b d ). 

If k, I are integers and l — k > 3, then T(kbe\, (k+l)bei;b) and T{lbe\, (l+\)be\ \ b) 
are disjoint, hence £ x (kbei,(k + l)be±;b) and C\(lbe\,(l + \)be\;b) are mutually 
independent, and so are Z\~ and Z\, as well as Yj~ and Y[. Then the sum in (35) may 
split into K > 3 sums of independent variables, and each sum has at most K~ 1 m 
summands. Note that each summand is almost surely bounded since Z v k + Y^ < 
Cx(kbe\,(k + l)foei;6) < W. Apply the Azuma's inequality for K = 4 separate 
sequences, 

, x ( C x (0, ruber, b) ) w h a ( (m - 3)e 2 \ 

where fjt b is the expectation E£ A (0,6ei;&) + {2?- 1 - \){E2% + EY P ) divided by 

Set mb = \x\ — y x \ and m = \\ i ' l ~ a ^ d \xx — yx\\- By the definition, both EZ£ and 
x are bounded above by E^(A,6) P — E£o(A, b) p defined in (31), and a direct 
computation shows that X^ p ~ 1 ^ d b~ 1 E^o(X, b) p shrinks to zero when \ l l d b > \ a / d -»• 
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oo. See Hwang (2012, Lemma 2.5) for more details. Apply Lemma 11 after scaling 
to see that m converges to C(d,p) as A — > oo. Then (36) becomes 



(37) p { xltZ y -l\ - CiAp) + 2E } - ^'"^ - yxle '"" X ° 



, A a |x A -y A |e 2 / / 1 
+ 4exp — -yj- 1 + o 



2 i+2 P V \\ a \x\ — yx 

as A and A q |xa — y\\ tends to infinity. The claim follows since \x\ — y\\ is bounded 
above by 2R\ and the first upper bound term has exponential decay in A^jx^ — y\\ 
as well. □ 

Proof of Lemma 4- Fix constants A > 1 and < A' < 1. Let N and N' be in- 
dependent Poisson variables with mean nA and nA' , respectively. Let H n denote 
the event that N > n and N' < n. Let a = A/(V d R d ) and a' = A'/(V d R d ) where 
Vd denotes the volume of unit ball B(z; 1). In other words, a (or a') is A (or A') 
divided by the volume of B(z; R2), respectively. Let K n denote the event that 

(38) ^^r^t'^ OWrt + l- 

{na'Y L p >' a \x n - y n \ 2 

We first show that if both H n and K n occur, then the following conditions are 
satisfied. 

(i) L n (x n , y n ) is a path in % na . 

(ii) L n (x n ,y n ) < (C(d,p)+e/2)(na'Y 1 -Py d \x n -y n \. 

Note that restriction of H na to B(z;R2) may be realized as Xn since na = 
nA/(V d R d ). Since H n is assumed to occur, it follows that N > n, and X n C <%jv = 
^ na fl B(z; R 2 ) C "H na . Therefore L n (x n , y n ) = L(x n , y n ; X n ) is a path in % na , and 
condition (ii) holds. 

For condition (ii), H n is assumed to occur, so we have N' < n. Then sim- 
ilar to the previous argument, % na i H B(z;R2) = X^< C X n and it follows that 
L(x n ,y n ;X n ) < L{x n ,y n \U na > CiB(z;R 2 )). Condition (ii) follows by (38). 

1 — P(H n ) is exponentially small in n by the Chernoff bound, and 1 — P(K n ) is 
exponentially small in (na') a \x n — y n \ by Lemma 12. Note that f(z) = l/(y d R d ) 
and A' is a fixed constant in this proof, hence 1 — P(K n ) is exponentially small 
in (nf(z)) a \x n — y n \. Lastly, when H n and K n occur, we have shown that (i) and 
(ii) hold, and an application of Lemma 10 shows that 1 — P(G n ), the probability 
that #L n (x n ,y n ) is greater than C*(nf '{z)) l ^ d \x n — y n \, is exponentially small in 
(nf(z)) 1 ^ d \x n — y n \ when C* > (C(d,p) +e/2)po. (See Lemma 10 for pQ.) The decay 
rate of 1 — P(G n ) is determined by the slower one, i.e., 1 — P(K n ), and the lemma 
holds. □ 
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4.4 Mean convergence in i.i.d. cases 

Proof of Proposition 5. Fix x n and y n so that L k denotes L k (x n , y n ) (not L k (xk,y k )) 
for all k > 0. Let C* > as in Lemma 4 and suppose that the number of nodes 
#L n in the shortest path L n is less than C*{nf (z)) l l d \x n — y n \. Suppose that the 
event F n from Lemma 3 occurred and all the shortest path edge lengths are at 
most (n / ' d . When a sample point from X n is discarded, L n _i remains 
the same as L n if the discarded sample point were not a node in L n . Furthermore 
since edge lengths are at most (nf(z)) (y0l ~ 1 ^ d , L n _i and L n may differ at most by 
2P(nf(z)Y a - l ^/ d . Therefore 

(39) EL„_! - EL n < C '* (n/(z)) ^ |x "~ l/w| • 2P(nf(z))( a - 1 M d + h n EL , 

where h n denotes the probability that either ffL n > C*(nf '{z)) l / d \x n — y n \, or the 
event F n does not occur. ELo in the last term is chosen because EL k < ELo for all 
k > 0. Let N be a Poisson variable with mean n. Write 

EL N = ^2EL k P{N = k}. 

k>0 

The difference \EL n — EL^\ is bounded above by 
Y)^Ln-EL k \V{N = k} 

< EL < 2 _1 n} + l EL « ~ EL k\ p i N = k )- 

k>2^ 1 n 

Note that the first term on the right of (39) has monotonic decrease for fixed x n 
and y n . Therefore if k > 2~ 1 n, 

1+P(q-1) 

\EL n -EL k \<2PC,\x n -y n \\n-k\(^y 1 ( n ^j * + EL ^ h h 



and since E\N — n\ < \fn and ELo = \x n — y n \ P , 



WL n -EL N \ < o((n/(z)r / dn - V2) + P^<2-W + I> 



(n/^ja-P^lxn-j/nl " vww; ' > {nf(z))^~v)/ d \x n -y n \ l - p ' 

where the summation ^ hi is still for I > 2~ 1 n. The first term on the right decays 
to zero since a < d/(2p). The second term also decays to zero since, while the 
denominator has at most polynomial decay in (nf(z)) a \x n — y n \, the numerator has 
exponential decay by the Chernoff bound, Lemma 4, and Lemma 3. Note that 
is identically distributed as T~i n /^v d R^) ^B(z; R2) where Vd is the volume of unit ball 
B(z; 1), and the proposition follows by Lemma 11 after a scale adjustment. □ 
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